Final Project

Leroy Ozores

Introduction

The FIFA World Cup! One of the most exciting tournament in the world that occurs every four year (in my opinion). The World Cup is between national teams from all over the world.

The data that I will be using comes from FIFA, “Fédération Internationale de Football Association”, which translates to, International Federation of Association Football in english. This dataset collects data from 900 World Cup matches that was played in its history from Uruguay in 1930 to Russia in 2018. Using data from Kaggle that includes the top ranks of the tournament, stage of the tournament, home/away team, outcome of the match, and more. My approach to this data is to make analysis and insights of all the World Cup tournamnet matches. My goal is to see if I can uncover new information in the data that isn’t clear for viewers.

When was the First Match?

## # A tibble: 2 × 6
##    year stage   date       home_team away_team     outcome
##   <dbl> <chr>   <date>     <chr>     <chr>         <chr>  
## 1  1930 Group 1 1930-07-13 France    Mexico        H      
## 2  1930 Group 4 1930-07-13 Belgium   United States A

When was the most recent match?

## # A tibble: 1 × 6
##    year stage date       home_team away_team outcome
##   <dbl> <chr> <date>     <chr>     <chr>     <chr>  
## 1  2018 Final 2018-07-15 France    Croatia   H

Attendance Rate

How popular is the FIFA World Cup? How many attended? Which year had the highest amount of audience?

year host attendance games winner
1994 USA 3568567 52 Brazil
2014 Brazil 3441450 64 Germany
2006 Germany 3367000 64 Italy

Ranking from Top to least

FIFA World Cup Winner

Each tournament is exciting, not knowing which National Team will take the title. I wouldn’t know which team has taken the most titles. Lets have a look.

Most Goals

Every match is exciting, especially when your team scores. Every one standing up screaming Gol, bumping to each other and even spealing our drinks to our neighbor. How many gols screams are there per game?

## # A tibble: 900 × 16
##     year stage   home_team away_team    home_score away_score Total_Gols country
##    <dbl> <chr>   <chr>     <chr>             <dbl>      <dbl>      <dbl> <chr>  
##  1  1930 Group 1 France    Mexico                4          1          5 Uruguay
##  2  1930 Group 4 Belgium   United Stat…          0          3          3 Uruguay
##  3  1930 Group 2 Brazil    Yugoslavia            1          2          3 Uruguay
##  4  1930 Group 3 Peru      Romania               1          3          4 Uruguay
##  5  1930 Group 1 Argentina France                1          0          1 Uruguay
##  6  1930 Group 1 Chile     Mexico                3          0          3 Uruguay
##  7  1930 Group 2 Bolivia   Yugoslavia            0          4          4 Uruguay
##  8  1930 Group 4 Paraguay  United Stat…          0          3          3 Uruguay
##  9  1930 Group 3 Uruguay   Peru                  1          0          1 Uruguay
## 10  1930 Group 1 Argentina Mexico                6          3          9 Uruguay
## # ℹ 890 more rows
## # ℹ 8 more variables: city <chr>, outcome <chr>, win_conditions <chr>,
## #   winning_team <chr>, losing_team <chr>, date <date>, month <chr>,
## #   dayofweek <chr>

How many Gols per year held?

Which Team has scored more goals?

## # A tibble: 81 × 2
##    home_team    home_score
##    <chr>             <dbl>
##  1 Brazil              177
##  2 Argentina           114
##  3 West Germany         93
##  4 France               85
##  5 Italy                72
##  6 Germany              65
##  7 England              50
##  8 Hungary              49
##  9 Spain                49
## 10 Belgium              45
## # ℹ 71 more rows

Matches during the week

Which teams have made it to the SemiFinal?

## # A tibble: 36 × 5
##     year stage      home_team      away_team     outcome
##    <dbl> <chr>      <chr>          <chr>         <chr>  
##  1  1930 Semifinals Argentina      United States H      
##  2  1930 Semifinals Uruguay        Yugoslavia    H      
##  3  1934 Semifinals Czechoslovakia Germany       H      
##  4  1934 Semifinals Italy          Austria       H      
##  5  1938 Semifinals Brazil         Italy         A      
##  6  1938 Semifinals Hungary        Sweden        H      
##  7  1954 Semifinals Austria        West Germany  A      
##  8  1954 Semifinals Hungary        Uruguay       H      
##  9  1958 Semifinals Brazil         France        H      
## 10  1958 Semifinals Sweden         West Germany  H      
## # ℹ 26 more rows

Which